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Abstract 

Severe acute respiratory syndrome (SARS) is an emerging infectious disease associated with a novel coronavirus and causing 
worldwide outbreaks. SARS coronavirus (SARS-CoV) is an enveloped RNA virus, which contains several structural proteins. 
Among these proteins, spike (S) protein is responsible for binding to specific cellular receptors and is a major antigenic determinant, 
which induces neutralizing antibody. In order to analyze the antigenicity and receptor-binding ability of SARS-CoV S protein, we 
expressed the S protein in Escherichia coli using a pET expression vector. After the isopropyl-(3 -d- thiogalactoside induction, S 
protein was expressed in the soluble form and purified by nickel-affinity chromatography to homogeneity. The amount of S protein 
recovered was 0.2-0.3 mg/100 ml bacterial culture. The S protein was recognized by sera from SARS patients by ELISA and Western 
blot, which indicated that recombinant S protein retained its antigenicity. By biotinylated ELISA and Western blot using biotin- 
labeled S protein as the probe, we identified 130-kDa and 140-kDa proteins in Vero cells that might be the cellular receptors re¬ 
sponsible for SARS-CoV infection. Taken together, these results suggested that recombinant S protein exhibited the antigenicity and 
receptor-binding ability, and it could be a good candidate for further developing SARS vaccine and anti-SARS therapy. 

© 2003 Elsevier Inc. All rights reserved. 
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Severe acute respiratory syndrome (SARS) is a newly 
emerging human disease, resulting globally in 774 deaths 
from 8098 probable cases [1]. A novel coronavirus has 
been identified as the etiological agent of SARS and 
designated as SARS coronavirus (SARS-CoV) after 
tests of causation according to Koch’s postulates, in¬ 
cluding monkey inoculation [2-4], SARS-CoV can infect 
African green monkey kidney (Vero E6) cells and cause 
a similar disease in cynomolgus macaques (Macaca 
fascicularis) [2,3]. The full-length genome sequence of 
SARS-CoV has been elucidated within weeks after the 
identification of this novel pathogen [5,6], SARS-CoV 
contains a single-stranded plus-sense RNA genome 
about 30 kb in length that has a 5'-cap structure and a 
3'-polyadenylation tract. The genomic organization is 
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typical of coronaviruses, having five major open reading 
frames (ORFs) that encode the replicase polyproteins; 
the spike (S), envelope (E), and membrane (M) glyco¬ 
proteins; and the nucleocapsid protein (N) in the same 
order as those of other coronaviruses [5-7]. 

When coronaviruses enter cells, the 5'-region of viral 
genome is translated into a large polypeptide that is 
cleaved by viral-encoded proteases to release RNA-de- 
pendent RNA polymerase and adenosine triphospha- 
tase/helicase. These proteins, in turn, are responsible for 
replicating the viral genome as well as generating nested 
transcripts that are used in the synthesis of viral pro¬ 
teins. Viral membrane proteins, including S and M are 
inserted into the endoplasmic reticulum (ER), while 
RNA genome assembles with the N protein. This RNA- 
protein complex then associates with M proteins and 
buds into the lumen of the ER. The virus particles then 
migrate through the Golgi complex and exist in the cells 
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by exocytosis [7], The first step in viral infection is the 
binding of viral proteins to certain cellular receptors. So 
far, the S protein of coronavirus is considered as the site 
of viral attachment to the host cells [8,9]. 

The S proteins of coronaviruses are large type I 
membrane glycoprotein projections from viral envelope 
[10]. S proteins are responsible for both binding to re¬ 
ceptors on host cells and for membrane fusion [11,12]. S 
proteins also contain important virus-neutralizing epi¬ 
topes that elicit neutralizing antibody in the host species 
[13,14]. Furthermore, mutations in this gene dramati¬ 
cally affect the virulence, pathogenesis, and host cell 
tropism [15-17]. These results suggested that S protein is 
a good candidate for vaccine because neutralizing anti¬ 
bodies are directed against S. Moreover, S protein is also 
a good target for antiviral therapies because blockade of 
binding of S protein to cellular receptor can prevent 
virus entry. Therefore, in this study, we expressed the 
SARS-CoV S protein in Escherichia coli (E. coli). The 
antigenicity and receptor-binding ability of recombinant 
S protein were further analyzed. 

Materials and methods 

Construction of recombinant plasmids. SARS-CoV RNA was pro¬ 
vided by Department of Medical Technology, China Medical University 
Hospital. SARS-CoV S gene was kindly provided by Dr. P.J. Chen. 
Briefly, the RNA was reverse transcribed into cDNA using SARS-M2 
primer (5'-GGAATTCGCCAAACATACCAAGGCC-3') and Super- 
Script III (Invitrogen) according to the manufacturer’s protocol. The 
3580-bp DNA fragment was then amplified from the cDNA template by 
polymerase chain reaction (PCR) using PfuUlter DNA polymerase 
(Stratagene) with SARS-P1 primer (5'-CGGGATCCTAGTGGTAGT 
GACCTTGACC-3') and SARS-M2 primer. The PCR profile was as 
follows: one cycle at 95 °C for 2 min; 30 cycles at 95 °C for 30 s, 55 °C for 
30 s, and 72 °C for 2 min. The amplified S cDNA fragments were cleaved 
by EcoRI and BamVW, and the 3580-bp fragments were then inserted into 
pET-28b(+) (Novagen) to create the pET-spike expression plasmid. 


DNA sequencing. DNA sequencing was performed on double- 
stranded plasmids by dideoxy chain termination with the BigDye Ter¬ 
minator Cycle Sequencing Ready Reaction kit (Applied Biosystems) and 
analyzed by ABP Prism 310 (Applied Biosystems). Dimethyl sulfoxide 
was introduced to the reaction to a final concentration of 5% to diminish 
the secondary structure. The primer used in the sequencing reaction was 
T7, SARS-P2 (5'-CGGGATCCTAATTACACCTGGAACAAATGC- 
3'), T7 terminator, SARS-M2, SARS-M3 (5'-TTGATATAGAACAGC 
AACTTCAGATGAAGC-3'), or SARS-M1 (5'-GGAATTCCACTTA 
CACCCCCAAAAGAGC-3')- DNA was sequenced on both strands of 
at least two repeats of cloned DNA fragments. 

Expression and purification of recombinant SARS-CoV S protein. 
Recombinant protein was expressed in E. coli BL21(DE3)pLysS strain 
by transforming the pET-spike to produce an N-terminal fusion with 
six histidine residues. The expression and purification of recombinant S 
protein were performed as described previously with modification [18], 
Briefly, cells were grown in 100 ml Luria-Bertani broth agitated at 
37 °C until OD 6 oo reached 0.6. Isopropyl-P-D-thiogalactoside (IPTG) 
was then added to a final concentration of 0.5 mM and the cells were 
pelleted 2 h after induction. The cell pellet was washed twice with ice- 
cold binding buffer (20mM Tris-HCl, pH 7.9, 500mM NaCl, and 
5 mM imidazole) and resuspended in 5 ml ice-cold binding buffer 
containing 8 M urea. The resuspended pellet was incubated at 4 °C 
with shaking for 2 h. After sonication, the suspension was centrifuged 
at 16,000g for 30 min at 4 °C. The resulting supernatant was applied on 
nickel-affinity chromatography with 8 M urea present throughout the 
procedure. The protein in the final column eluate was dialyzed over¬ 
night against phosphate buffered saline (PBS) (137 mM NaCl, 2.7 mM 
KC1, 4.3 mM Na 2 HP0 4 , and 1.4mM KH 2 P0 4 ) and stored at -70 °C 
until further analysis. Protein was analyzed by sodium dodecyl sulfate- 
polyacrylamide gel electrophoresis (SDS-PAGE) and was quantified 
with a Bradford assay (Bio-Rad). 

Cells and cell extracts. Human lung carcinoma cell (A-549), human 
liver cell (Chang liver), African green monkey kidney cells (Vero E6, 
Vero), and human monocyte (U-397) were purchased from Bioresources 
Collection and Research Center (Hsinchu, Taiwan). Cells were main¬ 
tained in Dulbecco’s modified Eagle’s medium supplemented with 10% 
fetal bovine serum at 37 °C and 5% C0 2 in air. The cell extracts were 
prepared by washing the cells once with ice-cold PBS and lysing the cells 
by lysis buffer (150 mM NaCl, 1% NP-40, 50 mM Tris-HCl, pH 8.0, and 
20 pM phenylmethylsulfonyl fluoride). After a 30-min-incubation on ice, 
the cell lysate was collected by centrifugation. Protein was quantified 
with a Bradford assay and stored at -30 °C until further analysis. 
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Fig. 1. Genome organization of SARS-CoV and putative region of SARS-CoV S protein. (A) Overall organization of the 29,727-nt SARS-CoV. 
Predicted ORF la and ORF lb encode replicase 1A and IB, respectively. S, E, M, and N genes encode spike, envelope, membrane, and nucleocapsid 
proteins, respectively. S gene is situated at 21,492-25,259 nucleotide in the SARS-CoV genome. (B) Predicted functional regions of SARS-CoV S 
protein. The upper diagram represents the 1255-residue S protein. Regions spanning 1-13 and 1196-1255 are putative signal peptide, and trans¬ 
membrane and cytoplasmic regions, respectively. Putative receptor-binding site is located at residue 757-761. 
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1 90 

SARS-CoV - mfifllfLt ltsa sdld.r cttfd.dvqapnytqh tssmr...gv Yyp.deifrs dtlyltqdlF lPf_.Y.. .sNv..tgfh 

HCoV-22 9E- 

TGEV-—mkkLfwl wmpliygdn fpcskltnrt ignhwnliet Fllnyssrls pnsdwlgdY FPtvqpwFnc ihNnsndlyv 

FIPV---'—'--Mi vlvtcLlllc syhtvlsttn neciqvnvtq lagnenlird Flf...snfk eegsvwggY YPt.evwYnc srtarttafq 

HCoV-0043 mflillisLp tafavlgdlk c.tsdnindk dtgpppistd tvdvtnglgt Yyvldrvyln ttlflng..Y YPtsgstY.. .rNmalkgsv 
MHV mlfvfltlLp sslgylgdfr ciqlvntdts nasapsvste wdvskgigt Yyvldrvyln atllltg. .Y YPvdgsmY.. .rNmaltgin 
HEV mffillitLp svfavlgdlk c.ntssindv dtgvpsisse wdvtnglgt Fyvldrvyln ttlllng..Y YPisgatF.. .rNvalkgtr 
BCoV mflillisLp malavlgdlk c.ttvsindv dtgvpsvstd tvdvtnglgt Yyvldrvyln ttlllng..Y YPtsgstY.. .rNnalkgtl 
Consensus-L--1---Y-Y YP-Y-N- 


91 180 

SARS-CoV tlnhtfgnp- -vipfkdgiy fAateksnw rgwv.fgstmnn ksqsvilinn stn.w.ir aCnfelcdnP 

HCOV-229E - 


TGEV tLenlkalyw dy.atenstw nhKqR.lnw vngypySitv tttmfns.. ..aegalici ckgspptttt essltcnwgS eCr.lnhkfP 
FIPV yfnnihafyf vmeamenstg nARgKpllfh vhgepvSvii sayrddvqqr pllkhglvci tknr..hiny eqftsnqwnS tCtgadrkiP 
HCoV-OC43 lLsrlwfkpp flsdfingi. fAKvKntkvi kdrvmySefp aitigstfvn tsyswVqpr tinstqdgdn klqglle.vS vCqynmceyP 
MHV tlslnwykpp flsefndgi. fAKvKnlkas lpkdsiSyfp tiiigsnfvt tsytwlepy.ngiim.aS iCqyticqlP 


HEV lLstlwfkpp flspfndgi. fAKvKnsrfs khgviySefp aitigstfvn tsysivVkph t....sfing nlqgflq.iS vCqytmceyP 
BCoV lLstlwfkpp flsdfingi. fAKvKntkvi kngvmySefp aitigstfvn tsyswVqph ttnl....dn klqglle.iS vCqytmceyP 
Consensus -L- - -AK-K--S-V--S -C-P 

181 270 

SARS-CoV ffavs.kpmg tqthtmlfdn afnctfeyis daFsldVsek sgNfkhLref vfknkdgfly vykgyqpidV vrdlpsgfnt lkpifklpLg 

HCoV-229E----mfV llvayallhi agcqttngLn 

TGEV icpsnseanc GnmlygLqwf adawayLhg asYrisfenq wsgtvtLgdm rattletagt lvdlwwfnpV ydvsyyrvn.nkngtt 

FIPV f..sviptdn GtkiygLewn ddfvtaylsg rsYhlnlntn wfNnvtLlys rsstatweys aa. .yayqgV snftyykln.ntngLk 

HCOV-0C43 qtich.pnl. GnhrkeLwhl dtgwscLyk mFtydV.NadyL.yf hfyqeggtfy ay..ftdtgV v. tkflfnvyLg 

MHV ytdck.pntg Gnkligfwht elkspvcllk rnFtfnV.NaewL.yf hfyqqggtfy ay..yadvss a. ttflfsmylg 

HEV qtich.pnl. GnqrieLwhh dtdwscLyr mFtydV.NadyL.yf hfyqeggtfy ay..ftdtgf v. tkflfklyLg 

BCoV htich.pnl. GnrrieLwhw dtgwscLyk mFtydV.NadyL.yf hfyqeggtfy ay..ftdtgV v. tkflfnvyLg 

Consensus-G-L--L-- —F-V-N-L-V-L- 

271 360 

SARS-CoV inltnfrail tafspaqdiw gtsaaaYfvg ylkpttfmLk ydengtltda vdcsqnpLae lkcsvkSfei dkGiY.qtsnFrv 

HCoV-229E tsysvcng.. .cvgysenvf avesggYips dfafnnwfLl tNtssWdgv vrsfqplLln clwsvsglrf ttGfvyfngt grgdckGFss 

TGEV v.vsnctd. . qcasyvanvf ttqpggFips dfsfnnwfLl tNsstlVsgk lvtkqplLvn clwpvpSfee aastFcfega gfdqcnGavl 
FIPV t.yelcedye hctgyatnvf aptsggYipd gfsfnnwfLl tNsstfVsgr fvtnqplLin clwpvpSfgv aaqeFcfega qfsqcnGvsl 

HCoV-OC43 maLshyyvmp . ...ltcn.. skltleYwvt pltsrqylLa fNqdgllfna edcmsdfMse ikcktqSiap ptGvY.elnGYtv 

MHV dvLtqyfvlp ymctlttt.. gvfspqYwvt plvkrqylfn fNqkglltsa vdcassytse ikcktqSmnp ntGvY.dlsGYtv 

HEV tvLshyyvmp ....ltcd.. salsleYwvt plttrqflLa fdqdgVLyha vdcasdfMse iirvcktsSitp ptGvY.elnGYtv 

BCoV tvLshyyvmp ....ltcn.. samtleYwvt pltskqylLa fNqdgVTfna vdcksdfMse ikcktlSiap stGvY.elnGYtv 

Consensus —L---Y -L- -N—VI-L-S-G-Y-GY— 

361 450 


SARS-CoV vpsgDW.rf pNItnlcpfg evfnatkfps vyaWerkkis nCvadysvly nstffstfkc ygVsatK... lndlcFsnvy adsfwkgdd 
HCoV-229E dvlsDViRy. .NLnfeenLr rg...til.f ktsYgvw.f yCtnntlvsg d....ahipf gtVlgnfycf Vnttignett safvgalpkt 

TGEV nntvDViRf. .NLnfttnVQ sgkgatvfsl nttggvtlei sCyndtvsds sfssygeipf gvtDgpRycy V lYngta Lkylgtlpps 

FIPV nntvDViRf. .NLnftadVQ sgmgatvfsl nttggvilei sCysdtvses ssysygeipf gitDgpRycy V—lYngta Lkylgtlpps 
HCoV-OC43 qpiaDVyRrk pNLpn.cnIE awlndksvps plnWerktfs nCnfnmsslm sfiqadsftc nnIDaaK... IygmcFssit Idkfaipngr 
MHV qpvglVyRrv rNLpd.cklE ewltaksvps plnWerktfq nCnfdlssll rfvqaeslsc snlDasK... VygmcFgsis Idkfaipnrr 
HEV qpvatVyRri pdLpn.cdlE awlnsktvss plnWerkifs nCnfnmgrlm sfiqadsfgc nnIDasR... lygmcFgsit Idkfaipnsr 
BCoV qpiaDVyRri pNLpd.cnIE awlndksvps plnWerktfs nCnfnmsslm sfiqadsftc nnIDaaK... IygmcFssit Idkfaipngr 
Consensus-DV-R-- -NL-IE-W-C-ID—K—- V-F-I- 


451 540 

SARS-CoV vrqlapgqtG viadynYklp ddfmgcvLaW N..tmidat ...stgnyny kyrylrhgkl rp.ferdis nvp..FsP.. 


HCoV-229E vrEfvisrtG hfyingYryf tlgnveavnF Nvttaettdf ctvalasyad vlvnVsqtsi aniiycnsvi nrlrcdQlsf dvpdgFysts 
TGEV vkEIaiskwG hfyingYnff stfpidcIsF NLttgdsdvf wtiaytsyte alvqVentai tkvtycnsyv nnlkcsQlta nlnngFyPvs 
FIPV vkEIaiskwG hfyingYnff stfpigcIsF NLttgvsgaf wtiaytsyte alvqVentai knvtycnshi nnlkcsQlta nlnngFyPva 
HCoV-OC43 kvDLqlgnlG ylqsfnYrid ttatscqLyY NLpaanvsvs .rfnpstwnk rfgfledsvf kprpagvltn hdWyaQhcf kapknFcPck 

MHV rvDLqlgnsG flqsfnYkid tratscqLyY sLaJcnnvtvn .nhnpsswnr rygfndvatf gt.gk hdVayaEacf tvgasYcPca 

HEV kvDLqvgksG ylqsfnYkid tavsscqLyY sLpaanvsvt .hynpsswnr rygflnqs.f gsr..gl... hdavysQqcf ntpntYcPcr 
BCoV kvDLqlgnlG ylqsfnYrid ttatscqLyY NLpaanvsvs .rfnpstwnr rfgfteqsvf kpqpvgvftd hdVvyaQhcf kaptnFcPck 
Consensus - - DL-G- Y--- - L-Y NL---V-- -F-P-- 

541 630 

SARS-CoV.dgkpc.tpp a....lncyw.pin dygfyt. 

HCoV-229E piqsvelpvs Ivslpvyhkh tfivlyvdf. kpqsGgGkcf ncypagvnit lanfnetkgp lCvdtshftt kyvavyanvg rw. 

TGEV ssevgfvnks Vvllptfyth tivnitiglg mkrsGyGqpi asTls..nit lpcnqdNnidv yCirsdqfsv .yvhstcksa lwdnvfkmc 
FIPV ssevgfvnks Wllpsffty tavnitidlg mklsGyGqpi asTls. .nit lpmqdNntdv yCirsnqfsv .yvhstckss lwdnifnqdc 

HCoV-OC43 ln..g..s.c Vgsgpg.knnGiGtcp agTnyltc.dN.... dKcn.. .Ifa nfilhdvnsg .ftgtyk.. 

MHV np..sivspc ttgkpnf.ancp tgTsnrectv mplanNqfkc dCtcnpsplt ...tydlr. 

HEV t.sqc Igga.GtGtcp vgTtvrkc.. faavtNatkc tCwcqpdpst ykgvnawt. 

BCoV Id., g.. sic Vgsgsgi.dag yknsGiGtcp agTnyltc.hNaaqc nClctpdpit skstgpyk. 

Consensus-V---G-G---T--N--C--- 

Fig. 2. Multiple alignment of coronavirus S proteins. Amino acid sequences of S proteins from human respiratory coronavirus (HCoV-229E) 
(GenBank Accession No. PI5423), porcine transmissible gastroenteritis virus (TGEV) (GenBank Accession No. CAA37285), feline infectious 
peritonitis virus (FIPV) (GenBank Accession No. P10033), human respiratory coronavirus (HCoV-OC43) (GenBank Accession No. AAA03055), 
murine hepatitis virus (MHV) (GenBank Accession no. AAC56567), porcine hemagglutinating coronavirus (HEV) (GenBank Accession No. 
AAM77000), and bovine coronavirus (BCoV) (GenBank Accession No. AAF25499) were aligned with those of SARS-CoV by PileUp program. 
Residues that are similar in all coronaviruses are shown in uppercase. The locations of signal peptide, and transmembrane and cytoplasmic regions of 
SARS-CoV S protein are underlined. The putative receptor-binding site of SARS-CoV S protein and antigenic sites of TGEV are shown in bold. 
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631 720 

SARS-CoV.tt giG.yqp yRw_Vis felLna. .pa tvcgpkl... stD...likn 

HCOV-229E .sasl ntGncpfsfg kvnnfvkfgs vCfslkdipg gcampivanw ayskyytlgs lyvswsdgdg itgvpqpveg vssfmnvtld 

TGEV tdvldatavl ktGtcpfsfd klnnyltfnk fclslspvga nckfDvaar. tRandqWrs lyvlyeegdn ivgvpsdnsg lhDlsvlhld 
FIPV tdvlEatavI ktGtcpfsfd klnnyltfnk fclslspvga nckfDvaar. tRtneqvVrs lyvlyeegdn ivgvpsdnsg lhDlsvlhld 
HCOV-0C43 ..cpQtkslV giGehcsgla vksdycgg.n sCtcrpqafl gwsaDsclqg ltcstdlqka ntD...iilg vCvnYdLYGI 

MHV ..clQarsmL gvGdhceglg vledkcggsn tCncsahafv gwakDsclan gRch...Ifs nlmLnginsg ttcsrndlqlp ntE...wtg 
HEV ..cpQskvsI qpGqhcpglg lveddcsg.n pCtckpqafi gwssEtclqn gRcn...Ifa nfiLndvnsg ttcstdlqqg ntn...ittd 
BCoV ..cpQtkylV giGehcsgla iksdycgg.n pCtcqpqafl gwsvDsclqg dRcn...Ifa nfiLhdvnsg ttcstdlqks ntD...iilg 
Consensus-Q-1 --G---C--D--R-1— L--—D- 


721 800 

SARS-CoV qCvnFnfnGL tGtGVltp.s skrfqpfqqf grdv.sdftd svRDpktsei ldlspCsfgg vsvitpgtna ssevavlYqd vnctdVstai 
HCOV-229E kCtkYnXYdV sGvGVirvsN dTfln. . .gl tYtStsGnLl GFKDvtkgti YsItpCnppd qlwyqqaw gamlsenFts ygfsnWe.. 
TGEV sCtdYnlYGr sGvGIirqtN rTlls...gL yYtSlsGdLl GFKnvsdgvi YsVtpCdvsa qaAvidgtiv gaitslnsel Lglthwtt.. 

FIPV sCtdYnlYGr tGvGIirrtN sTlls...gL yYtSlsGdLl GFKnvsdgvi YsVtpCdvsa qaAvidgaiv gamtslnsel Lglthwtt.. 

HCoV-OC43 lGqGIfvevN aTyynswqnL lYdS.nGnLy GFRDyiinrt FmlrsCysgr vsAafhan.. ssepaLlFm IkcnyVfnns ItrqlqPi.. 

MHV vCvkYdLYGI tGqGIfkevk adyyhswqnL lYdv.nGnLi GFRDfvanks YtlrsCysgr vsAayhqd.. apepaLlYm LkcdyVfnnn 

HEV vCvnYdLYGI tGqGIlievN aTyynswqnL lYdS.sGnLy GFRDylsnrt FllrsCysgr vsAvfhan.. ssepaLmFm LkcshVfnyt 

BCoV vCvnYdLYGI tGqGIfvevN aTyynswqnL lYdS.nGnLy GFRDyltnrt FmlrsCysgr vsAafhan.. ssepaLlFm IkcnyVfnnt 

Consensus -C--Y-LYGI -G-GI-N -T- L -Y-S— G-L- GFRD- Y-I— C- A - L-F— L -V- 

801 890 

SARS CoV hadqltPawr iystgnnvFq tqaGclIgae hvdts..yec dipigaGiCa syhtvsllr. ..stsqk..s ivaytmslga DSsiaysn.n 

HCOV-229E .lPkff yas..ngtYn ct.daVlTy s...sfGvCa dgsiiavq.pmvsy DSVsalv.ta 

TGEV.tPnfy yysiynytnD mtrGtaldsn dvdcepViTy s...niGvCk ngalvfin.vthsdg D.VQpIs.tg 

FIPV.tPnfy yysiynytsE rtrGtaldsn dvdcepViTy s...niGvCk ngalvfin.vthsdg D.VQpIs.tg 

HCoV-OC43 .nyFD sylGcvVnay nstaisVqTc dltvgsGyCv dysknrrsrg aittgyrftn fepftvnsvn DSlEpVggly elqIPseFTI 

MHV isreetPl.nyFD sylGcvVnad nsteeaVdac dlrmgsGlCv nystshrars svstgykltt fepftvrivn DSVEsVdgly 

HEV ilrqiqlv.nyFD sylGcvVnay nntasaVsTc dltvgsGyCv dyvtalrsrr sfttgyrftn fepfaanlvn DSlEpVggly 

BCoV IsrqlqPi.nyFD sylGcvVnad nstssaVqTc dltvgsGyCv dystkrrsrr aittgyrftn fepftvnsvn DSlEpVggly 

Consensus-P--FD-G--V--V-T-G-C-DSVE-V- 


891 980 

SARS-CoV tlalPtnFsI sittEvmpvs maktsVDCnm ylCgdsteCa nlLlqYgSfC tqLnraLsgi aaeqdrntrE vfaqvkqmyk tptlkyfG.. 
HCOV-229E nLsIPsnwTt svqvEYLQit stpIvVDCst yVCngnvrCv elLkqYtSaC ktledaLms arLesadvsE mltfdkkaft LanvssfGd. 

TGEV nVtIPtnFTI svqvEYIQvy ttpVsIDCsr yVCngnprCn klLtqYvSaC qtleqaLavg arLenmevds mlfvsenalk Lasveafnss 

FIPV nVtIPtnFTI svqvEYmQvy ttpVsIDCar yVCngnprCn klLtqYvSaC qtleqaLamg arLenmevds mlfvsenalk Lasveafnst 

HCoV-OC43 gnmvEFIQts spkVtIDCaa fVCgdyaaCk sqLveYgSfC DnlnaiLtev neLldttqlQ vanslmngvt LstklkdGvn fnvDdI..nf 

MHV eLqIPtnFTI ashqEFVQtr spkVtIDCaa fVCgghtaCr qqLveYgSfC DnlnaiLgev nnLidtmqlQ vasaliqgvt LssrlsdGig 

HEV elqIPseFTI gnleEFIQts spkVtIDCat fVCgdyaaCr qqLaeYgSfC EnlnaiLiev neLldttqlQ vanslmngvt LstkikdGin 
BCoV elqIPseFTI gnmeEFIQis spkVtIDCsa fVCgdyaaCk sqLveYgSfC DnlnaiLtev neLldttqlQ vanslmngvt LstklkdGvn 
Consensus -I-IP--FTI-EFIQ---V-IDC-- -VC-C-L—Y-S-C D-I-L-L-Q-L-G-- 

981 1070 

SARS-CoV_gf..nf sqil.pdplkpt k. . .RSflED LLFnKVtlad aGfmkq.Yge CIGdinarDL ICaQkFNGLt VLPplltddm 

HCOV-229E ynlssVi. . . ps.lptsgS rvagRSAIED ILFsKlvtSg lGtVdadYkk CTkGlsIaDL aCaQyYNGIm VLPgvadaer 

TGEV etiDplykew pnigGswleg IkyilpsdnS krkyRSAIED LLFsKVvtSg lGtVdedYkr CTGGyDIaDL VCaQyYNGIm VLPgvanadk 
FIPV enIDplykew psigGswlgg lkdilpshnS krkygSAIED LLFdKVvtSg lGtVdedYkr CTGGyDIaDL VCaQyYNGIm VLPgvanadk 

HCOV-0C43 spvlGclgse cska.S s...RSAIED LLFdKVklSd vGfVea.Ynn CTGGaEIrDL ICvQsYkGIk VLPpllsenq isgYTlaats 

MHV gqiDdI..nf spllGclgsd cgevtmaaqt g...RSAIED vLFdKVklSd vGfVea.Ynn CTGGqEVrDL LCvQsFNGIk VLPpvlsenq 

HEV fnvDdI..nf ssvlGclgse cnra.S t...RSAIED LLFdKVklSd vGfVqa.Ynn CTGGaEIrDL ICvQsYNGIk VLPpllsenq 

BCoV fnvDdl. .nf spvlGclgsd cnkv.S s...RSAIED LLFsKVklSd vGfVea.Ynn CTGGaEIrDL ICvQsYNGIk VLPpllsenq 

Consensus-D-I--G--S-RSAIED LLF-KV--S- -G-V—Y— CTQG-EI-DL IC-Q-YN3I- VLP- 

1071 1160 

SARS-CoV iaaYTaalvs gtatagwtfg aGaalqiPFa MqmayRfNgi gvtqnVLyeN QKqIAnqFNr AisqlqEslt tts.tAL 

HCOV-229E mamYTgslig gial. . .ggl .tsavsiPFs LalQaRLNyv alqtDVLqeN QKiLAasFNk AmtnlvDaFt gvNdaitqts qalqtvatAL 
TGEV mtmYTaslag gitl...gAl gGgavaiPFa vaVQaRLNyv alqtDVLn)cN QqiLAsAENq AignltQsFg kvNdaihqts rglatvakAL 

FIPV mtmYTaslag gitl...gAl gGgavaiPFa vaVQaRLNyv alqtDVLnkN QqiLAnAFNq AignltQaFg kvNdaihqts qglatvakAL 

HCOV-0C43 aslfppwtAa aG_vPFy LnVQyRINgl gvtmDVLsqN QKlIAnAFNn AlyalqEgFd atN.sAL vKIQaWNan 

MHV isgYTagatv samf.pwsAa aG....vPFs LsVQyRINgl gvtmnVLseN QKmIAsAFNn AigalqEgFa atN.SAL 

HEV isgYTsaata aslfppwtAa aG_vPFy LnVQyRINgl gvtmDVLsqN QKlIAsAFNn AldsIqEgFd atN.sAL 

BCoV isgYTlaats aslfppwsAa aG....vPFy LnVQyRINgi gvtmDVLsqN QKlIAnAFNn AldalqEgFd atN.sAL 

Consensus-YT--A- -G-PF- L-VQ-RIN---DVL--N QK-IA-AFN- AI--I-E-F-N-AL 


1161 1250 

SARS-CoV gKLQdWNqn aQALNtLvkQ LssnFgAISs vLnDIlsRLD kvEAEvQIDR LItGRLqsLq tYVtQqLira aelraSanlA atKmsECVlg 
HCOV-229E nKIQdWNqq gnsLNhLtsQ LrqnFqAISs SIqalydRLD tlQADqQVDR LItGRLaALN vFVShtLtky teVraSrqlA qQKVNECVKS 
TGEV aKVQdWNtq gQALshLtvQ LqNnFqAISs SIsDIynRLD eLsADAhVDR LItGRLTALN AFVSQtLtrq aeVraSrqlA kDKVNECVRS 
FIPV aKVQdWNtq gQALshLtvQ LqNnFqAISs SIsDIynRLD eLsADAQVDR LItGRLTALN AFVSQtLtrq aeVraSrqlA kDKVNECVRS 
HCOV-0C43 aEALNnLlqQ LsNrFgAISa SLqEIlsRLD aLEAEAQIDR LInGRLTALN AYVSQqLsds tlVkfSaaqA mEKVNECVKS 

MHV aKmQfWNan aEALNnLlnQ LsNrFgAISa SLqEIlsRLD aLEAqAQIDR LInGRLTALN AYVSkqLsdm tlVkvSaaqA iEKVNECVKS 
HEV vKIQaWNan aEALNnLlqQ LsNrFgAISa SLqEIlsRLD aLEAkAQIDR LInGRLTALN AYVSQqLsds tlVkfSaaqA iEKVNECVKS 
BCoV vKIQaWNan aEALNnLlqQ LsNrFgAISs SLqEIlsRLD aLEAqAQIDR LInGRLTALN AYVSQqLsds tlVkfSaaqA mEKVNECVKS 
Consensus -KIQ-WN-- -EALN-L--Q L-N-F-AIS- SL-EI—RLD -LEADAQIDR LI-GRLTAIN AYVSQ-L-V—S A -EKVNECVKS 


Fig. 2. ( continued ) 
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1251 1340 

SARS-CoV QSkRvdFCGk GyHLmSfpqa APhGwFLHv tyvPsqernf ttapalChe. GE...ayfpR e. . .gvFvfn gtsWfiTqrn fFsPqiiTtd 
HCOV-229E QSkRygFCGN GtHIfSIvna APeGLvFLHt vllPTqYkdv eawsGLCv.. .DgtnGyvlR qpnlalYke. gnyYriTsri mFePripTma 
TGEV QSqRfgFCGN GtHLfSLana APnGMiFfHa vllPTaYeTv tawaGICald GDrtfGlwK dvqltlFrnl ddkfylTprt mYqPrvaTss 

FIPV QSqRfgFCGN GtHLfSLana APnGMiFfHt vllPTaYeTv tawsGICasd GDrtfGlwK dvqltlFrnl ddkfylTprt mYqPrvaTss 

HCOV-0C43 QSsRinFCGN GnHIiSLvqn APyGLyFIHf syvPTkYvTa rvspGLCia. GDr..GiapK s...gyFvnv nntWmyTgsg yYyPepiTen 

MHV QSsRinFCGN GnHIiSLvqn APyGLyFIHf syvPTsFtTa nvspGLCis. GDr..GlapK a...gyFvqd dgeWkfTgsn yYyPepiTdk 

HEV QSsRinFCGN GnHIiSLvqn APyGLyFIHf syvPTkYvTa kvspGLCia. GDi..GispK s...gyFinv nnsWmfTgsg yYyPepiTqn 
BCoV QSsRinFCGN GnHIiSLvqn APyGLyFIHf syvPTkYvTa kvspGLCia. GDr..GiapK s...gyFvnv nntWmfTgsg yYyPepiTgn 


Consensus QS-R--FCGN G-HI-SL-AP-GL-FIH--PT-Y-T--GLC-GD-G-K-F--W--T--Y-P-T— 

1341 1430 

SARS-CoV ntfvsgnCdV vigiinntvy dplqPeldsf k. . .EELdky FKnhts. . PD vdlgdisgiN aswnlqkEi d.RLsev 

HCOV-229E dfVqienCnV tFvnisrseL qtivPeyiDv nktlQEL..s YKlpnytvPD LweqY. . .N qTiLnLtsEi stlenksael nytvqKLqtl 

TGEV dfVqiegCdV lFvnAtlsdL psiiPdyiDi nqtvQDIlen FR.pnwtvPE LtfdiF...N aTyLnLtgEi ddlefrsekl hnttveLail 

FIPV dfVqiegCdV lFvnAtvidL psiiPdyiDi nqtvQDIlen YR.pnwtvPE ftldiF...N aTyLnLtgEi ddlefrsekl hnttveLail 

HCoV-OC43 nvVvmstCaV nYtkApyvmL ntsiPnlpDf k. . .EELdqw FKnqtsvaPD Lsl.dY..iN vTfLdLqvEm n.RLqea 

MHV nsVvmssCaa nYtkApevfL ntsiPnlpDf k. . .EELdkw FKnqtsiaPD Lsl.dFeklN vTILdLtdEm n.Rlqda 

HEV nvVvmstCaV nYtkApdlmL ntstPnlpDf k...EELyqw FKnqsslaPD Lsf.dY..iN vTfLdLqdEm n.RLqea 

BCoV nvVvmstCaV nYtkApdvmL nistPnlpDf k... EELdqw FKnqtsvaPD Lsl.dY..iN VTfLdLqdEm n.RLqea 

Consensus --V-C-V -Y--A -L-P-D--EEL-FK-PD L-Y-N -T-L-L--E---RL— 

1431 1526 


SARS-CoV aknLNeslId LqeLgkyEqY IKWPWYVWLcr fiacrllaivm vtillCCmTs C.csclkaac sCasCCkfde ddsenvLkav klhvt- 

HCOV-229E IdnINstIVd LkwLnrvEtY IKWPWWVWLc Isvvllfvvs MLllcCCsTG Ccgffscfas sirgCCestk lpyyd.Veki hiq- 

TGEV IdnlNntlVN LewLnriEtY VKWPWYVWLL IgLvvIfcIp LLlFcCCsTG Ccgcigclgs cChsiCsrrq fenyeplekv hih- 

FIPV IdnlNntlVN LewLnriEtY VKWPWYVWLL IgLvvVfcIp LLlFcCfsTG Ccgcigclgs cChsiCsrrq fenyeplekv hvh- 

HCOV-0C43 IkvLNqsyIN LkdlgtyEyY VKWPWYVWLL IcLagVamLv LLfFiCCcTG Cgtscfk. . . kCggCCddyt gyqelvl. . . ktshdd 

MHV IkkLNesyIN LkdvgtyEmY VKWPWYVWLL IgLagVavcv LLfFiCCcTG Cgsccfk...kCgnCCdecg ghqdsiVihn isshed 
HEV IkvLNhsyIN LkdlgtyEyY VKWPWYVWLL IcLagVvmLv LLfFiCCcTG Cgtscfk...kCggCfddyt ghqefvl... ktshdd 
BCoV IkvLNqsyIN LkdlgtyEyY VKWPWYVWLL IgLagVamLv LLfFiCCcTG Cgtscfk...kCggCCddyt ghqelvl... ktshdd 
Consensus I--LN-IN L--L - E-Y VKWPWYVWLL I-L--V--L- LL-F-CC-TG C-C--CC--1-- 

Fig. 2. ( continued) 


Biotinylation of recombinant S protein. Recombinant S protein was 
mixed with Sulfo-NHS-biotin (Pierce) in a ratio of ten to one. After a 
2-h-incubation on ice, the unincorporated biotin was removed by 
centricon-10 (Amicon) and the biotinylated S protein was stored at 
4°C until further analysis. Sulfo-NFIS-biotin should be prepared 
freshly by dissolving in DDW. 

Western blot analysis. Proteins (5 pg) were separated by 10% SDS- 
PAGE and the protein bands were then transferred electrophoretically 
to nitrocellulose membrane (Amersham Pharmacia Biotech). Mem¬ 
brane was blocked in blocking buffer (20 mM Tris-HCl, pFl 7.6, 
140 mM NaCl, 0.1% Tween 20, and 5% skim milk) and probed with 
rabbit anti-spike (residue 511-993) polyclonal antibody or human anti- 
SARS sera at room temperature for 1 h. The bound antibody was 
detected with horseradish peroxidase-conjugated secondary antibody 
(Sigma) followed by chemiluminescence (ECL system, Amersham) and 
exposed by X-ray films. Rabbit anti-spike (residue 511-993) polyclonal 
antibody was prepared by immunizing rabbit with truncated spike and 
was provided kindly by Dr. P.J. Chen. Human anti-SARS sera were 
collected from SARS patients and provided by Dr. L.K. Chen. 

For biotinylated Western blot, cell extracts were separated by 10% 
SDS-PAGE and the protein bands were then transferred to mem¬ 
brane. After blocking, the membrane was probed with 0.1 pg/ml bio¬ 
tinylated S protein at 4 °C for 1 h. The bound protein was detected with 
peroxidase-conjugated avidin (Sigma) followed by chemiluminescence 
and exposed by X-ray films. 

Enzyme-linked immunosorbent assay (ELISA). Microtiter plates 
(Nunc) were coated at 4°C overnight with 50 pi of lOng/pl proteins, 
which was diluted in 0.05 M carbonate buffer (pH 9.6). The wells were 
rinsed with 200 pi washing buffer (0.5% Tween 20 in PBS) and blocked 
with 200 pi blocking buffer (5% bovine serum albumin (BSA) in 
washing buffer) by incubating at 37 °C for 30 min. The absorbed pro¬ 
tein in each well was challenged with 50 pi diluted human anti-SARS 
sera and incubated at 37 °C for 1 h. After three washes with washing 
buffer, 50 pi diluted peroxidase-conjugated secondary antibody was 
added to each well and incubated at 37 °C for 1 h. Following three 
washes, 50 pi chromogenic substrate, 2,2'-azinobis(3-ethylbenzthiazo- 


line-sulfonic acid), was added to each well and incubated at 37 °C for 
15 min. The absorbance was read at 405 nm in an ELISA plate reader. 

For biotinylated ELISA, the procedure was performed as described 
previously with modification. Briefly, microtiter plates were coated 
with 5 pg/well cell extract which was diluted in 0.05 M carbonate 
buffer, blocked with blocking buffer, and challenged with 0.1 pg/well 
biotinylated S protein. After a 1-h incubation at 37 °C, diluted per¬ 
oxidase-conjugated avidin was added and incubated at 37 °C for 1 h. 
Following three washes, chromogenic substrate was added to each well 
and incubated at 37 °C for 15 min. The absorbance was read at 405 nm 
in an ELISA plate reader. 


Results and discussion 

The SARS-CoV genome is approximately 29.7 kb 
long and contains five major ORFs flanked by 5' and 3'- 
untranslated regions of 265 and 342 nucleotides, re¬ 
spectively (Fig. 1 A) [5,6]. The predicted S gene is located 
from nucleotide 21,492 to 25,259 in the SARS-CoV 
genome. In order to clone S gene, the 3580-bp DNA 
fragment containing whole S gene was amplified by re¬ 
verse transcription-PCR and inserted into prokaryotic 
vector pET-28(+). The deduced amino acid sequence of 
S protein is shown in Fig. 2. S gene had an ORF of 3768 
nucleotides, capable of coding for a 1255-amino-acid 
polypeptide of about 138 kDa. The amino terminus of 
SARS-CoV S protein contained a short type I signal 
peptide (residue 1-13) composed of hydrophobic amino 
acids that are presumably removed during cotransla- 
tional transport through ER. The carboxyl terminus 
(residue 1196-1255) consisted of a transmembrane 
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domain and a cytoplasmic tail rich in cysteine residues. 
Twenty-three potential N-linked glycosylation sites were 
predicted among SARS-CoV S protein. Together these 
data predicted that SARS-CoV S protein is a type I 
membrane glycoprotein with the N-terminus and the 
majority of the protein (residue 14-1195) on the outside 
of virus particle, in agreement with other coronavirus S 
protein data (Fig. IB). 

Multiple alignment and phylogenetic analysis among 
S proteins of mammalian coronaviruses, which cause 
respiratory infection, were further performed by PileUp 
program of Genetics Computer Group (GCG) (Figs. 2 
and 3). Alignment of these sequences produced a low 
level of similarity (20-27% pairwise amino acid identity) 
between the predicted amino acid sequence of SARS- 
CoV S protein and other coronavirus S proteins. Phy¬ 
logenetic analysis showed that the species formed 
monophyletic clusters consistent with established taxo¬ 
nomic groups. However, SARS-CoV S protein se¬ 
quences segregated into a well-resolved branch, 
indicating that SARS-CoV S protein is not closely re¬ 
lated to any of the previously characterized coronavirus 
S proteins. Although overall sequence conservation was 
low, the C-terminus, consisting of a transmembrane 
domain and a cytoplasmic tail, was highly conserved. 
Putative cellular receptor-binding sites (residue 757-761) 
of SARS-CoV S protein [19] were not conserved com¬ 
pared with other coronaviruses. The antigenic sites of 
porcine transmissible gastroenteritis virus S protein 
[20,21] were also varied among these coronaviruses. 
These data suggested that comparison of primary amino 


acid sequences does not provide insight into the recep¬ 
tor-binding specificity or antigenic properties of SARS- 
CoV S protein. 

By analyzing the primary structure of SARS-CoV S 
protein, we tried to predict the putative antigenic regions 
of S protein. The hydrophilicity, surface probability, 
and chain flexibility of SARS-CoV S protein were cal¬ 
culated by PeptideStructure program of GCG according 
to Kyte and Doolittle plots, Emini prediction, and 
Karplus and Schulz prediction, respectively (Fig. 4A) 
[22-24], The antigenicity of SARS-CoV S protein was 
further analyzed based on three aforementioned criteria, 
and the regions in which antigenic index exceeds 1.3 are 
shown in Fig. 4B. Most of the putative antigenic sites 
were located at outer membrane regions, in which resi¬ 
due 1-100 and 401-500 exhibited the highly antigenic 
potential. Because most T cell and B cell epitopes con¬ 
tain a sequence of 6-20-amino acids [25,26], we further 
analyzed the antigenicity using a window of 15 residues. 
Residue 12-50, 426^156, 478^194, 541-564, and 922- 
1118 of SARS-CoV S protein displayed the highly an¬ 
tigenic potential. These data suggested that these regions 
might be good candidates for developing SARS peptide 
vaccine. 

In order to study the antigenicity and receptor¬ 
binding ability of SARS-CoV S protein, we expressed 
and purified the full-length recombinant SARS-CoV S 
protein from E. coli. We expressed the S protein from 
E. coli BL21(DE3)pLysS strain transformed with a pET 
plasmid carrying S gene. After induction with IPTG, a 
product with 138-kDa was observed by SDS-PAGE 



Fig. 3. Phylogenetic analysis of coronavirus S proteins. Amino acid sequences of SARS-CoV S proteins were compared with those of HCoV-229E, 
TGEV, FIPV, FICoV-OC43, MHV, HEV, BCoV, avian infectious bronchitis virus (IBV) (GenBank Accession No. PI 1223), and turkey coronavirus 
(TCoV) (GenBank Accession No. AAQ20922). Sequence alignment and neighbor-joining tree were generated by GrowTree program of GCG using 
the BLOSUM comparison matrix. Coronavirus groups I, II, and III are indicated. Branch lengths are proportionate to amino acid differences. 
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Fig. 4. Analysis of’ the primary sequence of SARS-CoV S protein. (A) The hydrophilicity, surface probability, and chain flexibility of SARS-CoV S 
protein were analyzed by PeptideStructure program of GCG. (B) Antigenic analysis of SARS-CoV S protein. The secondary structure and putative 
antigenic sites of SARS-CoV S protein were analyzed by PeptideStructure program of GCG using the Chou-Fasman prediction. Secondary structure 
was drawn with the PloStructure program. The predicted antigenic sites are shown as octagons. The putative glycosylation sites are shown as 
balloons. 


analysis (Fig. 5). The amount of induced S protein was 
consistent when the bacteria were refreshed to 0.45-0.6 
OD 60 o and the recombinant S protein was expressed in 
the soluble form in bacterial cells. The soluble S protein 
was further purified by affinity chromatography using 
His-Bond resin. No detectable S protein was purified, 
suggesting that the histidine tag is folded into the inte¬ 
rior of S protein (data not shown). We therefore dena¬ 


tured the S protein by urea, purified by nickel-affinity 
chromatography, and renatured the protein by dialysis. 
The amount of recombinant S protein recovered was 
approximately 0.2-0.3 mg/100 ml of bacterial culture 
(Fig. 6). 

To analyze the antigenicity of recombinant S protein, 
we performed Western blot and ELISA using sera from 
SARS patients or from spike-immunized rabbits. Fig. 7 
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Fig. 5. Optimal condition and distribution of recombinant SARS-CoV 
S protein in E. coli. (A) Optimal condition for expressing recombinant 
S protein. The E. coli BL21(DE3)pLysS strain containing pET-spike 
was inoculated at 37 °C with shaking. After an overnight incubation, 
the bacteria were refreshed to 0.45 (lane 3), 0.5 (lane 4), 0.55 (lane 5) or 
0.6 OD 60 o (lane 6) and the IPTG was then added to induce the protein 
expression. The preparations of uninduced (lane 2) and induced E. coli 
(lanes 3-6) were analyzed by 10% SDS-PAGE and stained by 
Coomassie brilliant blue. (B) Distribution of recombinant S protein in 
E. coli. After Ihe induction of IPTG, the total proteins (lane 3), soluble 
(lane 4), and insoluble fractions (lane 5) of induced E. coli were ana¬ 
lyzed by 10% SDS-PAGE and stained by Coomassie brilliant blue. 
Lane 2 represents Ihe preparation of uninduced E. coli. The molecular 
masses of protein standard (lane 1) are shown at the left. The 138-kDa 
recombinant S protein is indicated by the arrow. 


kDa I 2 3 4 5 6 



Fig. 6. SDS-PAGE analysis of recombinant SARS-CoV S protein. The 
total proteins (lane 3), soluble (lane 4), unbinding (lane 5), and purified 
fractions (lane 6) of induced E. coli were analyzed by 10% SDS-PAGE 
and stained by Coomassie brilliant blue. Lane 2 represents the prep¬ 
aration of uninduced E. coli. The molecular masses of protein standard 
(lane 1) are shown at the left. The 138-kDa recombinant S protein is 
indicated by the arrow. 


shows that recombinant S was detectable by both sera in 
both assays. Additionally, truncated S protein (residue 
511-993) exhibited the similar antigenicity compared 
with full-length spike (residue 1-1255). These results 
indicated that recombinant S protein remains its anti¬ 
genicity that could be recognized by sera from SARS 
patients. 

The receptor-binding ability of recombinant S pro¬ 
tein was analyzed by biotinylated ELISA and Western 
blot. Several reports indicated that SARS-CoV could 
have succeeded in growing progeny virus in Vero cells 
[3,5,6]. The BSA and Vero cell extracts were therefore 




(1-1255 a.a.) (511-993) 

Fig. 7. Antigenic analysis of recombinant SARS-CoV S protein. (A) 
Western blot analysis. The preparations of uninduced E. coli (lanes 1 
and 4), induced E. coli (lanes 3 and 6), and purified S protein (lanes 2 
and 5) were analyzed by 10% SDS-PAGE, transferred to nitrocellulose 
membrane, detected by rabbit anti-spike antibody (lanes 1-3) or 
human anti-SARS-CoV antibody (lanes 4—6), and subsequently visu¬ 
alized by chemiluminescence. (B) ELISA. The BSA and recombinant S 
proteins (residue 1-1255 and residue 511-993) were coated on ELISA 
plates and detected by human anti-SARS-CoV antibody. Values are 
means ± standard error of triplicate assays. 




Fig. 8. Receptor-binding ability of recombinant S protein. (A) Biotiny¬ 
lated ELISA. The BSA, Vero cell extract, and Vero E6 cell extract were 
coated on ELISA plates and detected by biotin-labeled S protein. Values 
are means ± standard error with triplicate assays. (B) Biotinylated 
Western blot. The cell extracts of Vero E6 (lane 1), Vero (lane 2), A-549 
(lane 3), Chang liver (lane 4), and U-937 (lane 5) were separated by SDS- 
PAGE, transferred to membrane, and detected by biotin-labeled S 
protein. The molecular masses of protein standard are shown at the left. 
The proteins present in Vero cells only are indicated by arrowheads. 
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coated on ELISA plates and challenged with biotin- 
labeled S protein. The binding ability of S protein to cell 
extract was evaluated by OD 405 value in ELISA. Fig. 8 A 
shows that the binding ability of S protein to BSA was 
very low. However, S protein significantly bound Vero 
cell extracts with the OD 405 exceeding 0.9. These results 
proved the specificity of biotinylated ELISA in analyz¬ 
ing the receptor-binding ability of S protein. It also 
suggested that recombinant S protein could be served as 
a probe to analyze the cellular receptors involved in 
virus attachment. 

To further identify the potential cellular receptors for 
SARS-CoV attachment, we performed biotinylated 
Western blot. The cell extracts were separated by SDS 
PAGE, transferred to membrane, and detected by bio¬ 
tin-labeled S protein. Fig. 8 B shows that recombinant S 
protein interacted with several cellular proteins in dif¬ 
ferent cell types. By comparison of protein patterns of 
different cell types, we found that two proteins with 
molecular masses of 130 and 140 kDa were detectable in 
Vero cells instead of other cell types. It is now known 
that Vero cell is the only cell line that could be infected 
by SARS-CoV [3]. These results suggested that 130-kDa 
and 140-kDa polypeptides in Vero cells might be the 
cellular receptors responsible for SARS-CoV S protein 
binding. The elucidation of amino acid sequences of 
these proteins is now proceeded. 

In this study, we cloned, expressed, and purified the 
SARS-CoV S protein from E. coli. The recombinant S 
protein was expressed in soluble form in bacterial cells, 
and the amount of protein recovered was 0.2-0.3mg/ 
100 ml bacterial culture. The S protein was recognized by 
sera from SARS patients, indicating that recombinant S 
protein retained its antigenicity. By biotinylated ELISA 
and Western blot using recombinant S protein as the 
probe, we identified that 130-kDa and 140-kDa proteins 
might be the cellular receptors responsible for SARS-CoV 
infection. These results suggested that S protein could be a 
good candidate for further developing SARS vaccine and 
anti-SARS therapy. 
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